Patch #1200

jiqing-feng · 2025-03-17T03:25:57Z

Hi @echarlaix @IlyasMoutawwakil @sywangyi .

This is our 1st step to enable patched model + torch.compile. We plan to disable all ipex fuse linear patch because torch.compile also has fuse opimization. We cannot defaultly use torch.compile for patched models because there are still incompatible issues for flash attention.

We plan to enable the patched model + torch.compile in ipex 2.7 or 2.8

This PR works for non-generation models like bert as patched bert model has no flash attention which will block the torch.compile .

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

HuggingFaceDocBuilderDev · 2025-03-17T03:31:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng added 2 commits March 12, 2025 12:24

upgrade transformers to 4.49 for patching models

160f65c

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

update setup

1b0dc0d

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng added 6 commits March 17, 2025 11:14

disable linear fusion when using compile

e3b970c

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

use max-autotune

d7af7ba

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'upgrade' into patch

a27f0d1

fix compile param

733bbc4

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix tests

4db4db5

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into patch

bbf09cd

jiqing-feng marked this pull request as ready for review March 18, 2025 04:36

disable max-autotune

581aa6f

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Patch #1200

Patch #1200

jiqing-feng commented Mar 17, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 17, 2025

Patch #1200

Are you sure you want to change the base?

Patch #1200

Conversation

jiqing-feng commented Mar 17, 2025 • edited Loading

HuggingFaceDocBuilderDev commented Mar 17, 2025

jiqing-feng commented Mar 17, 2025 •

edited

Loading